A deep learning framework for modeling structural features of RNA-binding protein targets.

نویسندگان

  • Sai Zhang
  • Jingtian Zhou
  • Hailin Hu
  • Haipeng Gong
  • Ligong Chen
  • Chao Cheng
  • Jianyang Zeng
چکیده

RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification and Evaluation of Novel Drug Targets against the Human Fungal Pathogen Aspergillus fumigatus with Elaboration on the Possible Role of RNA-Binding Protein

Bakground: Aspergillus fumigatus is an airborne opportunistic fungal pathogen that can cause fatal infections in immunocompromised patients. Although the current anti-fungal therapies are relatively efficient, some issues such as drug toxicity, drug interactions, and the emergence of drug-resistant fungi have promoted the intense research toward finding the novel drug targets. Methods: In searc...

متن کامل

Investigation the Mechanism of Interaction between Inhibitor ALISERTIB with Protein Kinase A and B Using Modeling, Docking and Molecular Dynamics Simulation

The high level of conservation in ATP-binding sites of protein kinases increasingly demandsthe quest to find selective inhibitors with little cross reactivity. Kinase kinases are a recently discovered group of Kinases found to be involved in several mitotic events. These proteins represent attractive targets for cancer therapy with several small molecule inhibitors undergoing different ph...

متن کامل

Identification of RNA-binding sites in artemin based on docking energy landscapes and molecular dynamics simulation

There are questions concerning the functions of artemin, an abundant stress protein found in Artemiaduring embryo development. It has been reported that artemin binds RNA at high temperatures in vitro, suggesting an RNA protective role. In this study, we investigated the possibility of the presence of RNA-bindingsites and their structural properties in artemin, using docking energy ...

متن کامل

Quantifying sequence and structural features of protein–RNA interactions

Increasing awareness of the importance of protein-RNA interactions has motivated many approaches to predict residue-level RNA binding sites in proteins based on sequence or structural characteristics. Sequence-based predictors are usually high in sensitivity but low in specificity; conversely structure-based predictors tend to have high specificity, but lower sensitivity. Here we quantified the...

متن کامل

Investigating the Mechanism of Action of SARS-CoV-2 Virus for Drug Designing: A Review

Coronavirus Disease 2019 (COVID-19) is a viral pneumonia emerged in December 2019 in Wuhan, China. Its cause is a new virus from the coronavirus family scientifically named Coronavirus Acute Respiratory Syndrome 2 (SARS-CoV-2). In this review study, articles published in English until March 23, 2020 on new coronavirus infection were reviewed. These articles are obtained by searching in PubMed, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 44 4  شماره 

صفحات  -

تاریخ انتشار 2016